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Abstract — All communication networks comprise of transmis- 
sion systems and switching systems, even though they are usually 
treated as two separate issues. Communication channels are 
generally disturbed by noise from various sources. In circuit 
switched networks, reliable communication requires the error- 
tolerant transmission of bits over noisy channels. In packet 
switched networks, however, not only can bits be corrupted 
with noise, but resources along connection paths are also subject 
to contention. Thus, quality of service (QoS) is determined by 
buffer delays and packet losses. The theme of this paper is to 
show that transmission noise and packet contention actually have 
similar characteristics and can be tamed by comparable means to 
achieve reliable communication. The following analogies between 
switching and transmission are identified. 

1. Buffering against contention is a process that is similar to 
the error correction of noise corrupted signals. A signal-to-noise 
ratio that represents the carried load of packet switches can be 
deduced from the Boltzmann model of packet distribution. 

2. When deflection routing is applied to Clos networks the 
loss probability decreases exponentially, which is similar to the 
exponential behavior of the error probability of binary symmetric 
channels with random channel coding. In information theory, this 
result is stated as the noisy channel coding theorem. 

3. The similarity between Hall's condition of bipartite match- 
ing and expander graph manifests the resemblance between 
nonblocking route assignments and error-correcting codes. An 
extension of the Sipser-Speilman decoding algorithm of expander 
codes to route assignments of Benes networks is given to illustrate 
their correspondence. 

4. Scheduling in packet switching serves the same function 
as noiseless channel coding in digital transmission. The smooth- 
ness of scheduling, like source coding, is bounded by entropy 
inequalities. 

5. The sampling theorem of bandlimited signals provides 
the cornerstone of digital communication and signal processing. 
Recently, the Birkhoff-von Neumann decomposition of traffic 
matrices has been widely applied to packet switches. With respect 
to the complexity reduction of packet switching, we show that the 
decomposition of a doubly stochastic traffic matrix plays a similar 
role to that of the sampling theorem in digital transmission. 

We conclude that packet switching systems are governed by 
mathematical laws that are similar to those of digital transmission 
systems as envisioned by Shannon in his seminal 1948 paper, A 
Mathematical Theory of Communication. 

Index Terms — Clos network, complete matching, channel cod- 
ing, source coding, sampling theorem, scheduling 



I. Introduction 

ALL communication networks comprise of transmission 
systems and switching systems, even though they are 
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(a) Packet switching with contention (b) Transmission channel with noise 



Fig. 1. The parallel characteristics between packet switching and digital 
transmission 

usually treated as two separate issues. Communication chan- 
nels are generally disturbed by noise from various sources. 
In circuit switched networks, resources are dedicated to con- 
nections and reliable communication only requires the error- 
tolerant transmission of bits over noisy channels. In packet 
switched networks, however, not only can bits be corrupted 
with noise, but resources along connection paths are also 
subject to contention. Despite the great achievements of infor- 
mation theory in dealing with transmission noise [l]-[3], there 
are still many networking problems in modern communication 
systems that information theory is unable to provide solutions 
for. A particular problem that has come to the fore is the delay 
that is induced by contention in packet switched networks, the 
solution for which lies in extending the theory to go beyond 
the bit-rate of transmission [4]. 

The theme of this paper is to show that transmission noise 
and packet contention actually have similar mathematical 
characteristics and can be tamed by comparable means to 
achieve reliable communication. The source information of 
the transmission channel is a function of time, and errors are 
corrected by coding scheme, which is an expansion of signal 
space. For switching systems, source information is a space 
function f(i) = j, for i = 1, 2, N form inputs Vi to outputs 
V , which represents a set of connection requests. Packets lost 
in contention are usually buffered or deflected to stretch out 
time and defer their requests. As shown in Fig. Q] the two 
processes are anti-symmetric with respect to time and space 
and they are both governed by the law of probability. 

In transmission systems, the fundamental QoS parameter 
is bit-rate, or channel capacity, which can be pinpointed 
by signal-to-noise ratio. In packet switched networks, QoS 
parameters, buffer delay and packet loss, are all determined by 
loading. In order to provide a common ground for comparison, 
the carried load of a packet switch is converted into a pseudo 
signal-to-noise ratio (PSNR) induced by Boltzmann's model 
of packet distribution. Based on this correspondence between 
carried load and PSNR, we show that a packet switched Clos 
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Fig. 2. Comparison of transmission systems and switching systems 



network with random routing can be modelled as an abstract 
channel with additive Gaussian noise. 

A schematic diagram of transmission channel and a three- 
stage Clos network [5] are shown in Fig. [2] The input 
modules in the first stage and output modules in the third 
stage of the Clos network correspond to transmitters and 
receivers, respectively, of the transmission system. The number 
of central modules is the bandwidth of the Clos network. 
The disturbances due to internal contentions in the middle 
stage of a packet switched Clos network mimic the noise of 
a channel. In this paper, we demonstrate that various routing 
schemes applied to a packet switched Clos network to cope 
with contentions are comparable to coding schemes of a 
transmission channel. The routing schemes of packet switching 
and their counterparts in digital transmission are listed in Fig fj] 
Depending on the implementation of routing schemes, and 
similar to the separation of channel coding and source coding 
in information theory, the Clos network can either be modelled 
as a noisy channel with nonblocking routing, analogous to 
channel coding, or a noiseless channel with route scheduling, 
analogous to source coding, as explained below. 

The noisy channel model of the Clos network. We first 
compare the deflection routing of packet switching with the 
random coding over noisy channel. Errors introduced by noises 
in transmission are subdued by redundant bits, while packet 
contentions can be relieved by redundant links in switching. 
When deflection routing is applied to Clos networks, the loss 
probability decreases exponentially with respect to the network 
length, which is similar to the exponential behavior of the error 
probability of binary symmetric channels with random channel 
coding. In information theory, this result is stated as the noisy 
channel coding theorem [1], [2], [6], [7] . 

In contrast to deflection routing, nonblocking routing in 
the Clos network will avoid internal contentions completely 
by conflict-free route assignments according to input requests 
[8]-[10]. The route assignment algorithms are based on the 
matching of bipartite graphs. In the error-correcting code, the 
low density parity check (LDPC) code developed by Gallager 
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Fig. 3. Analogies between packet switching and information transmission 



can also be represented by a bipartite graph [11], [12], and a 
subclass called the expander code was constructed by Sipser 
and Speilman [13]. The condition on the expander graph is 
a generalization of Hall's condition on complete matching, 
which suggests the resemblance between nonblocking route 
assignments and error-correcting codes. An extension of the 
Sipser and Speilman decoding algorithm of expander codes 
to route assignments of Benses networks is given to illustrate 
their correspondence. 

The noiseless channel model of the Clos network. A 
drastically different approach, called path switching, to deal 
with contention in a Clos network is proposed in [14]. A set 
of connection patterns of central modules is determined by the 
traffic matrix decomposition for this purpose. Path switching 
periodically uses this finite set of predetermined connections 
in the central stage of the Clos network to avoid online com- 
putation of route assignments. Once the connection patterns of 
central modules are fixed in each time slot, incoming packets 
can be scheduled accordingly in the input buffer. Regarding 
the predetermined connections as a code book, scheduling is a 
process similar to source coding in digital transmission [1], [2], 
[6], [7]. The smoothness of scheduling, like noiseless coding, 
is bounded by entropy inequalities. 

The decomposition of traffic matrices, sometimes called 
the Birkhoff-von Neumann decomposition, has been widely 
used in SS/TDMA satellite communications [15], [16]. Path 
switching adopted this scheduling scheme in packet switching 
systems to guarantee the capacity of virtual paths in Clos 
networks. The same approach was applied to the crossbar 
switch, also called the Birkhoff-von Neumann switch, in 
[17]. Mathematically, the series expansion and reconstruction 
of doubly stochastic capacity matrices are similar to the 
Fourier series expansion and interpolation of sampling theorem 
of bandlimited signals [18]— [21]. They also serve the same 
function in terms of complexity reduction of communication 
systems. The capacity matrix decomposition employed in path 
switching will reduce the dimension of permutation space of 
a Clos network from AM to 0(N 2 ) or even lower, while 
the sampling theorem in transmission reduces the infinite 
dimensional signal space of any duration T to a finite number 
of samples. 

The remainder of the paper is organized as follows. In 
section II, we propose the definition of pseudo signal-to-noise 
ratio (PSNR) of packet switch, and prove that the trade-off 
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between the bandwidth and PSNR of a Clos network is the 
same as that given by noisy channel capacity theorem in 
transmission. In section III, we show that the loss probability 
of the Clos network with deflection routing is similar to 
the exponential error probability of binary symmetric chan- 
nels with random coding. In section IV, we demonstrate the 
correspondence between nonblocking route assignments and 
error-correcting codes. In section V, we address the capacity 
allocation and capacity matrix decomposition issues related to 
the path switching implemented on a Clos network. In section 
VI, the entropy inequalities of smoothness of scheduling 
are derived, and comparisons of scheduling algorithms are 
discussed. Finally, the conclusion and future research are 
summarized in section VII. 

II. Parallel Characteristics of Contention and 
Noise 

The apparent causes of contention and noise are quite dif- 
ferent, but they both limit the performance of communication 
systems. In transmission, the trade-off between bandwidth and 
signal-to-noise ratio is given by Shannon-Hartley's noisy chan- 
nel capacity theorem [19]. In packet switching, the difference 
between offered load and carried load reflects the degree of 
contention. In order to provide a common ground to compare 
contention and noise, a pseudo signal-to-noise ratio is defined 
to represent the carried load of a packet switch. We show that 
the packet-switched Clos network with random routing can be 
mathematically modelled as a noisy channel. 

A. Pseudo Signal-to-Noise Ratio of Packet switch 

Output port contention occurs in a crossbar switch when 
several packets are destined for the same output in a time 
slot. Packets lost in contention will be dropped as shown in 
Fig. [4] Consider an N x N crossbar switch, without any prior 
knowledge of input traffic. We assume that input loading p 
is homogeneous and output address is uniformly distributed, 
then the carried load p' is the probability that an output is 
busy in any given time slot 



P ' = 1 - (1 - 
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(1) 



The difference between offered load and carried load is 
caused by packet dropping due to contention at the outputs. 
Benes considered the number of calls in progress as the energy 
of the connecting network in the thermodynamics theory of 
traffic in a telephone system [22]. A similar traffic model was 
also explored in the broadband network [23]. If we regard 
packets as energy quantum, this proposition can be adopted in 
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packet switching and stated as follows. 

Proposition 1: (Benes): The signal power S p of an N x N 
crossbar switch is the number of packets carried by the system, 
and the noise power equals to N p = N — S p . Furthermore, 
both the signal power and noise power are Gaussian random 
variables when N is large. 

This proposition is verified by the Boltzmann model of 
packet distribution given in Appendix I [24]. It follows that 
the carried load of a crossbar switch can be converted to 
pseudo signal-to-noise ratio, a concept that is comparable to 
the signal-to-noise ratio (SNR) in transmission. 

Definition 1: The pseudo signal-to-noise ratio (PSNR) of 
a crossbar switch is the ratio of mean signal power to mean 
noise power 

E[S P ] _ Np' _ p' 



PSNR = 
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It should be noted that the ratio of signal power to noise 
power SNR in transmission is defined by the ratio of the 
second moment of signal to that of noise, but PSNR is the 
ratio of their mean power here. Next, we will investigate the 
trade-off between bandwidth and PSNR of the Clos network. 

B. Clos Network with Random Routing as A Noisy Channel 

In a three-stage Clos network, each pair of adjacent switch 
modules are interconnected by a unique link, and each module 
is a crossbar switch. As shown in Fig. [5] the N x N Clos 
network with k input/output modules and m central modules 
is denoted by C(m,n, k) [5], [8], where n is the number of 
input ports or output ports of each input module or output 
module, where N — kn. The dimensions of the modules in 
the input, middle and output stages are nxm, kxk and m x n, 
respectively. The Clos network C(m, n, k) has the following 
characteristics: 

1) Any central module can only assign to one input of each 
input module, and one output of each output module. 

2) Source address S and destination address D can be 
connected through any central module. 

3) The number of alternative paths between input S and 
output D is equal to the number of central modules m. 



Input nxm 

- 








m-l n-l 






















k-l 

m-l n- 





Output 




nQ D +R D 
nQ D +n — \ 

n(k-l) 
rat-1 



Middle stage 



Output stage 



Fig. 4. The carried load of an N X N crossbar switch 



Fig. 5. The Clos network C(m, n, k) 
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In circuit switching theory, it is known that the Clos network 
C(m, n, k) is rearrangeably nonblocking if m > n and strictly 
nonblocking if m > 2n — 1 [8], [25]. Consider the number 
of cental modules m as the bandwidth, the trade-off between 
bandwidth m and PSNR of C(m,n,k) with nonblocking 
routing is given by 



E[S r ] 
E[NJ 



E[S P _ 
E\N„ 



(3) 



In a packet switched Clos network, the maximal data rate of 
each input module is n packets per time slot. Thus we define: 

Definition 2: The ratio a = n/m is the maximal utilization 
of a Clos network C(m, n, k). 

The route of each packet can be expressed by the numbering 
scheme of the Clos network [26]. The switch modules in each 
stage of the network, as well as the links associated with 
each module, are independently labelled from top to bottom. 
According to this numbering scheme, the source address S is 
represented by the 2-tuple S(Qs, Rs)< indicating the source S 
is the link Rs of the input module Qs, where Rs — [S]„ and 
Qs = \_S/n\ are the remainder and quotient of S divided by n. 
Similarly, the destination address D can be represented by the 
2-tuple D(Qd, Rd)- The path of an input packet from source 
S to destination D is determined by the choice of central 
module. Suppose the path goes through the central module G, 
then the routing tag is (G, Qd, Rd) and the path is expressed 
by: 



S(Qs,Rs)^Qi 



qQl 



Qd D(Q D ,R D ) 



In a packet switched Clos network C(m, n, k), contentions 
may occur in the middle stage even if destination addresses of 
input packets are all different. With random routing scheme, 
the trade-off between bandwidth m and PSNR is given as 
follows: 

Theorem 1: The maximal data rate of each input module 
of the packet switched Clos network C(m, n, k) with random 
routing is given by 



to ln(l 



E[S P ] 
E[N P _ 
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(4) 



Proof: 

The maximal data rate is achieved when the input loading 
p = 1 and destination addresses of input packets are all 
different in every time slot, in which case input modules and 
output modules are contention-free. If the central module is 
selected randomly for each input packet, then the loading on 
each input link of a central module is given by 

a = n/m 

Since each switch module of the Clos network is a crossbar 
switch, the carried load on each output link of a central module 

is 

n l m \k fc->+po_ i _ p -n/m 
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Fig. 6. Trade-off between bandwidth m and PSNR of C(m, n, k) 

Substituting the following PSNR for the carried load a' in (01 

E[S P ] kma' a' 



E[N p ] km — kma' 



1 - & 



and taking logarithm, we obtain @. □ 

The trade-offs given in (f3]) and (0]l for constant data rate n 
packets per time slot of each input/output module are plotted 
in Fig. [6] which shows the improvement of PSNR that can be 
achieved by contention-free routing. The same kind of trade- 
off between bandwidth and SNR in the transmission channel 
is stated in the Shannon-Hartley theorem [19] on noisy cannel 
capacity as follows: 

Noisy Channel Capacity Theorem The channel capacity of 
a bandlimited Gaussian channel in the presence of additive 
Gaussian noise is given by 

C = W\og{l S 



N 



(6) 



(5) 



where C is the capacity in bits per second, B is the bandwidth 
of the channel in Hertz, and Mr is the signal-to-noise ratio. 

With the understanding that the contention is mathemati- 
cally analogous to noise, the rest of paper seeks to investi- 
gate the connections between the various routing schemes of 
the Clos network and the corresponding coding schemes of 
the transmission channel. We first demonstrate the similarity 
between deflection routing and random coding in the next 
section. 



III. Clos network with deflection routing 

Packet contention is inevitable in packet switching. One way 
of solving this problem without having to buffer the losing 
packet is to use deflection routing. In contrast to "store and for- 
ward" routing where the network allows buffering, deflection 
routing, also known as hot potato routing, is a routing scheme 
without buffering and can only be implemented for packet- 
switched networks [27]. In case the individual communication 
links cannot support more than one packet at a time, excessive 
packets will be transferred to other available links. Deflection 
routing actually utilizes idle or redundant links of the network 
as temporary storages. Redundancy is built into the switch 
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design so that deflected packets can use extra stages to correct 
the deviated routes. 

A. Cascaded Clos Network 

A cascaded Clos network is constructed by a sequence 
of alternate C(n,n,k) and C(k,k,n), as illustrated in Fig. 
[7] Each output link of a switch module is connected to a 
module in the next stage and an output concentrator (not 
shown in Fig. |7j. A packet will be sent to the concentrator if its 
destination address matches the output numbering, otherwise it 
will continue with the remaining journey. The loss probability 
can be made arbitrarily small by providing a large enough 
number of stages. 

In the cascaded Clos network, the destination address D 
of a packet can either be expressed by D = nQi + R\ in 
the C(n, n, k) network, or D = kQ2 + R2 in the C{k, k, n) 
network, as shown in the example given in Fig. [7] and therefore 
the routing tag in the packet header comprises a 4-tuple 
(Qi, Ri, Q21 -^2). A packet can reach its destination D in 
any two consecutively successful steps, either (Qi,i?i) or 
(Q2,R2)- The Markov chain shown in Fig. [8] describes the 
complete journey of a packet, where pi and q± = 1 — p± are 
respective probabilities of success and deflection in C(n, n, k) 
and C(k, k, n). 

B. Analysis of Deflection Clos Network 

For the sake of simplicity, we assume that the cascaded Clos 
network is formed by C(n, n, n) network uniformly. Since we 
will consider the worst case deflection probability when both n 
and k are sufficiently large, the assumption that n = k does not 
lose the generality of analysis, and the packet loss probability 
will be a conservative estimate. Under this presumption, a 
simplified Markov chain is depicted in Fig. [9] 

The probability of success p in an n x n switch module is 
a function of input loading p. Under the homogeneous traffic 
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Fig. 7. Cascaded Clos network with deflection routing 
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Fig. 8. The Markov chain of deflection routing 
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Fig. 9. The simplified Markov chain of deflection routing 



assumption, we have 

1 — pp = (1 ) ► e p . 

n 

Thus, the worst probability of success is given by 

1 -&-P 
P = 



The probabilities of success p and deflection q = 1 — p versus 
the loading p are plotted in Fig. [TUta). Let G,(fe) be the 
probability that the packet in state i will reach the output state 
O in exactly k steps. The following equations can be derived 
from the Markov chain shown in Fig(9] 



Go(k) 



if k = 
otherwise. 



G R (k) = P G (k - 1) + qG Q (k - 1), k = 1, 2, . 



G Q {k)=pG R (k-l) + qG Q {k-l), k = 1,2,... (7) 
The generating function Gq(/c) is given by 



G, 



) (z)=J2G Q (k)z k = T 



2 2 
p z z 



fe=0 



qz — pqz^ 



(8) 



from which we obtain the following steady state probabilities 

k = 0,1 

G Q (fc) = <J ^- q (pq) {k+1)/2 cosh(k-l)9 fc = 2,4,6---- 
Ji( TO )(fe+i)/2 sinh ( fc _ i)6» k = 3, 5, 7 • • • • 

(9) 



where „ = ^±^2 and 9 = In q+y /$£™ ■ For given 
network length L, a conservative estimate of packet loss 
probability is given as follows. 



Ploss< G «( fc )' 
k=L+l 

which can be explicitly expressed 

P < \ ^( M ) (i+2)/2 sinh(i + 2)c? 
1 ^(M) (i+2)/2 cosh(L + 2)c) 



(10) 



for even length L 
for odd length L. 
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(a) Deflection probability (b) Parameters in equation (7) 
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Fig. 10. Loss probabilities of deflection routing 

When L is large, the logarithm of Pi oss is linear in L as shown 
by the curves in FigQjJc) 

InPioss <m(L + 2) + b, (11) 

where m = ln( 9+v /q ' +ipq ) and b = In(— J ). In the 

worst case when p = 1, p = 0.6321, q = 0.3679, m = 
-0.3566, and b = 0.9683. 

Theorem 2: If the offered load p < 1 on each input of the 
Clos network with deflection routing, then the loss probability 
can be arbitrarily small and the carried load p' on each output 
can be arbitrarily close to the carried load p. 

Proof: It follows from (fTTT i that the loss probability in 
terms of offered load p and carried load p 1 is bounded by 

Pioss = P — JL < ca- L . (12) 
P 

The two parameters a and c are given by following functions 
of deflection probability q 

2 

a = > 1, 

q + V i 2 + 4 P<7 

and 

c= (q + Vq 2 + ^pq) 2 1 

Aq^Jq 2 + Apq 

The parameters a and c versus the offered load p are plotted in 
Fig. [TUto), in which a = 1.4285 and c = 1.2906 when p = I. 

□ 



Conversely, if the offered load p > 1, then the loss 
probability is unbounded, because the number of packets input 
to a switch module could be more than the total number of 
outputs and there is not enough space to deflect packets, which 
can be lost halfway through. 

Compare the deflection Clos network with the binary sym- 
metric channel (BSC) with random coding, as illustrated in 
Fig. QT| Table I provides the parallels between the above 
theorem and Shannon's main theorem [1], [2], [6], [7] stated 
as follows. 

Noisy Channel Coding Theorem Given a noisy channel 
with capacity C and information transmitted at a rate R, 
then if R < C, there exists a coding technique which allows 
the probability of error at the receiver to be made arbitrarily 
small. This means it is possible to transmit information without 
error up to a limit C. However, if R > C, the probability of 
error at the receiver increases without bound. 

The capacity of BSC with cross probability q = 1 — p is 
given by 

C = l+plogp+(l-p)log(l-p) (13) 




Fig. 11. Binary Symmetric Channel (BSC) 

The noisy channel coding theorem states that there exists 
an encoding function E : {0,l} fc — > {0,1}™ and a decoding 
function D : {0, 1}" -> {0, l} fe , such that the error probability 
that the receiver gets a wrong message is bounded by 

P e < a - n + C - n o>l,c>l (14) 

if the code rate R = | = C—5 for some S > [28]. Thus, the 
error probability P e can be arbitrarily small for a sufficiently 
large n. Conversely, if the rate R = — — C + S for some 
5 > 0, then the error probability is unbounded. 
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Binary Symmetric Channel 


Deflection Probability q < ^ 


Cross Probability q < ^ 


Deflection Routing 


Random Coding 


P< 1 


R<C 


Exponential Loss Probability 

Pl oss < ca~ L 


Exponential Error Probability 

P e < a~ n + c~ n 


Complexity increases 
with network length L 


Complexity increases 
with code length n 


Equivalent set of outputs 


Typical set decoding 



TABLE I 

Comparison of deflection Clos network and Binary Symmetric 
Channel 

The similar behavior of the deflection routing of the Clos 
network and the random coding of noisy channel suggests the 
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connection between nonblocking route assignments and error- 
correcting codes. This analogy can be extended to multistage 
interconnection networks as well. The deflection routing of 
the tandem banyan network proposed in [29] is equivalent to 
the concept of error-detection and retransmission, while the 
deflection routing of the dual-shuffle exchange network is a 
distributed error-correction algorithm [30]. These points will 
be further elaborated on in the next section. 

IV. Route Assignments and Error-Correcting 
Codes 

The Clos network C(m, n, k) is rearrangeable if m > n, 
meaning that it can realize connections of any permutations 
between inputs and outputs with the possibility of rearranging 
the calls in progress. Both Slepian-Duguid's nonblocking 
theorem [8], [25] and M. C. Paull's rearrangeable theorem 
[31] on route assignments of the Clos network are rooted in 
matching theory, which starts with Hall's marriage theorem 
that gives the existence condition of a complete matching in 
a given bipartite graph [32]. In the error-correcting code, the 
low density parity check (LDPC) code can also be represented 
by a bipartite graph [11], [12]. The connection between route 
assignment and graphic code in the context of bipartite graph 
is the main point addressed in this section. 

A. Complete Matching in Bipartite Graphs 

A bipartite graph G — (Vl,Vr, E) consists of two finite 
sets Vl, Vr of vertices, and a collection of edges E connecting 
vertices in Vl to vertices in Vr. 

Definition 3: A complete matching in a bipartite graph G 
is an injective function / : Vl — > Vr so that for every x G Vl, 
there is an edge in E whose endpoints are x and f(x). 

Definition 4: The neighborhood of any subset A C Vl is 
defined by 

N A = {b\{a,b) e E,ae A} C V R 

which are endpoints of an edge in E whose other endpoints 
lie in A. 

The necessary and sufficient condition for a bipartite graph 
to have a complete matching is given by Hall's marriage 
theorem [32], [33]. Because it is the most fundamental theorem 
in switching theory, we state the theorem and give R. Rado's 
elegant proof [34] below. 

Theorem 3: (Hall) Let G = (V L ,V R ,E) be a bipartite 
graph, there exists a complete matching for G if and only 
if 

\N A \ > \A\ (15) 

for any subset A C Vl- 

Proof: It is clear that the condition is essential for the 
existence of a complete matching, it remains to show that it is 
sufficient. Let V L = {1,2, ...,n}, and F = {Ni, N 2 , N n }, 
by way of contradiction, suppose Ni contains x, y, the removal 
of either x or y violates Hall's condition. There exists A,BC 
{2, ...,n} with the property 

R A = N A U(N 1 -{x}), \Ra\<\A\ 
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Fig. 12. An edge colored regular bipartite graph and decomposition of integer 
doubly stochastic matrix 



Ri 



N B U (Nt ~ {y}), 



\Rb\ < \B\ 



Then \R A UR B \ = U JVi|, and \R A nR B \ > \N A nB\ 

It follows that 



\A\ + \B\ > \R A \ + \Rb\ = \RaUR b \ + \Ra n Rb\ 

>\N AuB UN x \ + \N AnB \. (16) 

On the other hand, from Hall's condition, we have 

WAuB^ml + lNAnB] > \A(JB\ + l + \AnB\ = \A\ + \B\ + 1. 

(17) 

Combining ( fTSI l and ( fl7| i together will lead to 

\A\ + \B\ > \A\ + |B| + 1, 

a contradiction. Hence, the removal of either x or y does 
not violate Hall's condition, and a complete matching can be 
obtained by repeating this procedure until each Ni contains 
only one element. □ 

The nonblocking route assignment of the Clos network and 
the edge coloring of the bipartite graph stated in the follow- 
ing theorem are equivalent consequences of Hall's marriage 
theorem. 

Theorem 4: The following statements are equivalent: 

1. A regular bipartite graph G with degree n can be edge 
colored by m colors if m > n. 

2. (Slepian-Duguid) The Clos network C(m, n, k) is rear- 
rangeably nonblocking if m > n. 

Proof: 1. For any subset A C Vl, the edges terminate on 
vertices in A must be terminated on vertices in its neighbor- 
hood set N A at the other end, Hall's condition holds because 

n\N A \ >n\A\ => \N A \ > \A\. 

The bipartite graph G can be reduced to n complete 
matching by repeatedly applying Hall's theorem. It is clear 
that edge coloring can be satisfied if m > n. The edge 
colored bipartite graph is equivalent to the decomposition of 
an integer doubly stochastic matrix into permutation matrices. 
An example to illustrate this point is shown in Figfl2l 

2. Consider a set of call requests 
{(So, Dq)i ■ ■ ■ , (Sn-i, Dn-i)}> m which the destination 
addresses are all different, and suppose the central module 
Gi is assigned to the request (Si, Di) for i = 0, • • • , N — 1. 
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Fig. 13. Correspondence between Clos network and bipartite graph 
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TABLE II 
The set of routing tags (G, Q, R) 



This set of assignments is nonblocking, or contention-free, if 
and only if 



and 



[Si/n\ = [Sj/nJ =► d ? Gj 



[Pi/n\ = [Dj/nl Gj 



for all i ^ j. It simply means that if either of the two 
sources Si, Sj are on the same input modules, or the two 
destinations Di,Dj are on the same output module, then 
the central modules assigned to (Si, Di), (Sj, Dj) must be 
different. 

The route assignments can be formulated as the edge color- 
ing of a bipartite graph G(Vl, Vr, E), in which vertices of Vl 
represent input modules and vertices of Vr represent output 
modules, and each connection request (Si, Di) is represented 
by an edge ([Si/n\, \Di/n\) in E. This bipartite graph is 
regular with degree n, and can be colored by to colors if 
to > n, such that each color represents a central module and 
the edge coloring is the set of route assignments. □ 

An example to show the equivalence between route assign- 
ment of the Clos network and edge coloring of the bipartite 
graph is given in FigfO] The resulting route assignments of 
the call request are listed in Table [TTJ The connection between 
route assignment and error-correcting code will be addressed 
in the sequel. 

B. Graphical Codes 

The low-density parity-check (LDPC) code is a class of 
linear block code that can be represented by bipartite graphs 
G(Vl, Vr, E), called Tanner graph [12], in which Vl is the 
set of variable-vertices and Vr the set of constraint-vertices. 
The parity matrix specifies the edge set E. Each variable can 
assume the value of or 1, and a constraint is satisfied if the 
sum of all the variables adjacent to it is zero mod 2. Fig. [T4l 




Unsatisfied 

x + x, + x, + x s = 
Satisfied 



x 2 + x 5 + x 6 + x, = 
Satisfied 

x +x,+x 4 +x 6 =l 
Unsatisfied 

Closed Under (+) 2 



Fig. 14. Low density parity check code 



illustrates an example of such a bipartite graph that represents 
the following matrix 



10 110 1 
1110 10 
1 1 1 1 
10 110 10 



(18) 



A vector x € {0, 1}" is a code word if and only if x satisfies 
all constraints. The set of all code words is closed with respect 
to mod 2 sum, and forms a linear subspace of {0, 1}™. 

A subclass of graphical codes based on expander graphs was 
constructed by Sipser and Speilman [13]. The expander code 
has the following linear time sequential decoding algorithm. 

Sipser-Speilman decoding algorithm: 

If there is a variable vertex v such that most of its neighboring 
constraints are unsatisfied, flip the value of v. Repeat. 

The decoding algorithm of expander codes guarantees that 
the number of unsatisfied vertices will be monotonically 
decreasing until all constraints are satisfied. The main result 
is stated in the following theorem. 

Theorem 5: (Sipser-Speilman) Let G(Vl,Vr,E) be the 
bipartite graph of size \Vl\ = n, \ Vr\ = to that is k— regular 
on the left. Assume that for any subset A C Vl of size 
\A\ < an, \Na\ > ^\A\, then the decoding algorithm will 
correct up to errors. 

The similarity between Hall's condition ( TT~5b on complete 
matching and the condition on expander graphs in the above 
theorem is quite obvious. The resemblance between these 
two structure conditions on bipartite graphs suggests the con- 
nection between error-correcting code and route assignment 
of Clos network. The Sipser-Speilman decoding algorithm 
is extended to the nonblocking route assignment of Benes 
network to illustrate their correspondence. 

C. Route Assignments of Bense Network 

A Bense network is a multi-stage connecting network that 
can be recursively constructed from Clos network C(2,2,k) 
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[0 unsatisfied 
Module 2 num = < 

[1 satisfied 

Fig. 15. Nonblocking constraints of Benes network 



[8], [25]. A set of call requests is a mapping from inputs to 
outputs. An example is given in ( fT9] l, which displays a set of 
connection requests in an 8 x 8 Benes network. The upper row 
of the matrix is the ordered inputs from to 7, and the lower 
row consists of the respective outputs. 

,= (° 1 2 3 4 5 6 M (19) 
^16057243/ J 

The two central modules in C(2,2, y), as shown in Fig. 
U2Ja), are labelled respectively by and 1. Let x$ be the 
binary variable associated with the request (i,7r(i)), for i = 
0, • ■ ■ , N — 1. The value of Xi £ {0, 1} defines the following 
route assignment 

{0 if module assigned to (i, 7r(i)) 
1 otherwise 

(20) 

Since the same central module cannot be assigned to two 
inputs or two outputs on the same module, the necessary and 
sufficient condition on nonblocking route assignment can be 
simply formulated as a set of linear constraints 

Xi + Xj = 1 (21) 

for all i 7^ j such that either two inputs i and j are on the 
same input module, or two outputs 7r(z)) and are on the 
same output module. 

As shown in FigQ3] let Vl = {xq,--- , xjv-i} be the 
set of variables and Vr be the set of constraints d2"Tt . a 
bipartite graph G(Vl, Vr, E) similar to that of LDPC can be 
constructed to solve for the values of x^s, which determine 
the nonblocking routes of the set of call requests in it. It 
should be noted that the set of solutions to (f2T~b is not closed 




Input module ~ Output module 

constraints Variables constraints 



Fig. 16. The flip algorithm for nonblocking route assignments 




Fig. 17. The connection requests and and route assignments 



with respect to mod 2 sum. In fact, two solutions in each 
connected component of the bipartite graph G(Vl, Vr, E) are 
complementary to each other, and the total number of solutions 
of ( f2Tb should be 2 9 , where g is the number of connected 
components of G. The following route assignment algorithm 
is a modification of Sipser-Speilman decoding algorithm. 



The flip algorithm for nonblocking route assignments: 

Step 1 Initially assign xq = 0,X\ — 1,X2 — 0, x 3 — 1, • ■ • 

to satisfy all input module constraints. 
Step 2 In each cycle of the bipartite graph, unsatisfied 

vertices divide the cycle into line segments. Label 

them a and f3 alternately and flip the values of all 

variables located in a segments. 



The initial values of all variables can be arbitrarily assigned. 
Compared with the Sipser-Speilman decoding algorithm, the 
initial assignment can be considered as the received signal, 
and errors are systematically removed by the flip algorithm to 
satisfy all constraints. 

Theorem 6: All constraints are satisfied when the flip al- 
gorithm terminates. 

Proof: The proof of this theorem is illustrated by the 
example displayed in Fig. [T6] First, we want to show that 
every cycle has an even number of unsatisfied vertices. It is 
clear that the total number of constraints is even because there 
is an equal number of input modules and output modules in 
each cycle. Assume by way of contradiction that the number 
of unsatisfied vertices is odd, then the number of satisfied 
vertices must also be odd. Let u and ui be the number of 
unsatisfied vertices whose neighboring variables both equal 
and 1, respectively, and let s be the number of satisfied 
vertices. Then, the number of variables equal to and 1, 
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Times slot 



Fig. 18. The complete route assignments of Benes network 



respectively, should be Mo+§ ar, d M i + f > which are impossible 
if s is odd. 

Next, we want to show that all constraints are satisfied 
when the flip algorithm terminates as shown in FigJT6j For 
an arbitrary constraint vertex v E Vr, let Xi , Xj € Vl be 
the two neighboring variables of v. If v is unsatisfied, the two 
variables x^ and Xj should have the same value but are located 
in segments bearing different labels. Only one located in the a 
segment will flip the value, and the vertex v should be satisfied 
when the algorithm terminates. 

On the other hand, if the constraint v is initially satisfied, 
then Xi and Xj flip simultaneously if they are both in an a 
segment, or else they keep the same value if they are located 
in a j3 segment. Hence, the constraint v remains satisfied in 
either case when the algorithm terminates. □ 

The route assignments of the set of call requests ( [T9l l 
resulting from the flip algorithm are displayed in FigfT7l and 
the complete assignments shown in Fig. [18] can be determined 
iteratively. If both N and n are of the power of 2, the 
set of equations (f2TT> can also be solved by the parallel 
algorithm proposed in [35] with time complexity on the order 
of O(log 2 (A0). 




Fig. 19. Capacity of virtual path as average number of edges 
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Fig. 20. Noiseless virtual path of Clos network 



V. Clos Network as Noiseless Channel-Path 
Switching 

Path switching proposed in [14] is a compromise of static 
routing and dynamic routing schemes for a Clos network. A set 
of connection patterns of the central modules is predetermined 
according to the traffic between pairs of input and output 
modules. This set of connection patterns is repeatedly used in 
a cyclic manner such that online path hunting can be avoided 
and bandwidth requirements can be satisfied in the long run. 

The scheduling of path switching is based on the following 
relationships between the edge colored bipartite graph and the 
connection pattern in the middle stage of the Clos network as 
depicted in Fig. [T3l earlier: 

1) The number of edges ey represents the number of 
packets that can be sent from input module Ii to output 
module Oj, the virtual path Vy, in one time slot, 

2) Each color of the bipartite graph corresponds to a 
central module of the Clos network. 

This correspondence suggests that the scheduling of switch ac- 
cording to predetermined connection patterns if traffic matrix 
is expressed as weighted sum of a finite number of permutation 
matrices as shown in Fig. [19] 



For any given traffic matrix T = [Ay], where Ay is the 
number of packets per time slot from input module Ii to output 
module Oj, such that J^i Ay < n < m, and ^\ Ay < n < m, 
if there exists a finite number F of regular bipartite graphs 
such that the capacity cy of the virtual path Vy between Ii 
and Oi satisfies 



F 



> K 



(22) 



where ey (t) is the number of edges from node i to node j in 
the t bipartite graph. The bandwidth requirement T = [Ay] 
can be satisfied if the system periodically provides connections 
according to the edge coloring of these F bipartite graphs. 
Adopting convention in the TDMA system, each cycle is called 
a frame and the period F frame size. The routing information 
can be stored in the local memory of each input module to 
avoid the slot-by-slot computation of route assignments [36], 
[37]. 

Since the connection patterns of switch modules in the mid- 
dle stage are fixed, the virtual path between any pair of input 
module and output module is contention-free. As shown in 
Fig. [20] input/output modules are scaled to a much smaller size 
and routing decisions in these modules become independent. 
Each input module is an independent input-queued switch, 
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arrival packets can be scheduled in the input buffer according 
to predetermined routes in every time slot by some matching 
algorithm similar to the scheduling algorithm for input-queued 
crossbar switch [38], while each output module could be 
an output-queued switch similar to a knockout switch [39]. 
Consider the scheduled Clos network as a noiseless channel, 
the scheduling of path switching maps central modules and 
time slots into incoming packets, a process similar to the 
source coding of transmission [1], [2], [6], [7] if the set 
of predetermined connection patterns is regarded as a code 
book. In this section, we address the capacity allocation and 
traffic matrix decomposition issues, and the smoothness of 
scheduling will be discussed in the next section. 



Capacity Allocation Algorithm 



Step 1 Initialization: Set k = 1 and c,y = Ay. 
Step 2 Calculate slack capacity of each input module (row) 
and output module (column). 



-4 



(fe-i) 



En 



„(fc-l) 



1 °« 



D 



(fc-i) 



V^n J fc -1) 

Z-<i=i °y 

,(fc-i) 



Step 3 Remove any saturated row i if Af L > = 0, and 
column j if X ' = 0. The algorithm terminates 
if there is no slack capacity left, otherwise do 



A. Capacity Allocation 

The capacity allocation problem seeks to find the capacity 
cy > Ay for each virtual path Vy between Li and O 



3 

m 



such that non-overbooking condition J2i c ij = S j c ij 
for each input/output module is observed. The problem can 
be formulated as constrained optimization of some objective 
function [40]. The choice of the objective function depends on 
the stochastic characteristic of the traffic on virtual paths and 
the quality of service requirements of connections. Following 
Kleinrock's independency assumption [41], [42], each virtual 
path can be modelled as an M/M/l queue with arrival rate 
Ay and service rate c^ for all i, j, then the average delay for 
the packets from input module Ii to output module Oj is given 
by 

Da = — (23) 



L !j Ay 

and the objective is to minimize the total weighted delay [41] 

A ' ' (24) 



£ 



A; 



subject to cy > Ay and £\ cy = Xy c ij — m - The 
computation of the optimal allocation is quite involved in gen- 
eral. Nevertheless, the following heuristic algorithm based on 
Kleinrock's square root rule [41] can always yield suboptimal 
capacity allocation that is close to the optimal solution. 

Theorem 7: For any given traffic matrix T — [Ay] with 
non-zero entries, Ay > for all i and j, then 



and 



n 

£ A *i 



<£ C , 



<£ c > 



m 



(25) 



(26) 



when the capacity allocation algorithm terminates. 
Proof: Since Ay > 0, it is obvious that 



J*0 > Jfc-i) 



(27) 



in k iteration of the algorithm for all k. Also, we have 



A 



(fe-i) 



j'=i 



=i 



to. 



(28) 



min(- 



A 



(fe-i) 



Ai, B 



(k-i) 



and set k = k 



En rr — ' ^r^n 

1, goto step 2. 



The combination of (|27T i and ( |28] | assures that Y^j=x Ci 

Cj 



jj=X u v — 

m, and similarly, that Y^i=i °ij = TO ' when the algorithm 
terminates. □ 



Notice that if some entries Ay are zero, then the strict 
inequality (|27T > does not hold in general, and there is no 
guarantee that the algorithm will halt. It is also true for any 
capacity allocation algorithms that if Ay = implies cy = 0, 
then the doubly stochastic matrix C = [cy] that satisfies (|25| > 
and d26i > may not exist. Nevertheless, this problem can be 
eliminated in practice if a small amount of capacity cy = e 
allocated to a virtual path with Ay = is allowed. 

B. Capacity Matrix Decomposition 

The time axis is divided into frames of time-slots, and 
the frame size is denoted by F. Within each frame, the rate 
requirement will be satisfied by a set of connection patterns, 
which are predetermined by the decomposition of the capacity 
matrix [cy]. In a time-slotted system, it is reasonable to 
assume that all entries cy are rational numbers with a least 
common denominator F. Then [F ■ c^] is an integer doubly 
stochastic matrix with constant mF row sum and column sum. 
According to theorem @] the bipartite graph corresponding 
to the matrix [F ■ cy ] can be colored by mF colors. The 
colored bipartite graph can also be expressed by the genralized 
Birkhoff-von Neumann decomposition [43]-[45] stated in the 
following theorem. 

Theorem 8: (Generalized Birkhoff-von Neumann Decom- 
position) The capacity matrix C has the following expansions 

^ mF F K 

C = p E M * = f £ Gl = £ ^Pi, (29) 

2—1 i — 1 2—1 



such that d = Y,™ m (i-i)+l M 3> i = 1 • ' " • F and 



With respect to the scheduling of path switching, the cor- 
respondence between connection patterns in the Clos network 
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1.4 0.5 1.1 
0.6 1.8 0.6 
1.0 0.7 1.3 

capacity matrix 
(per slot) 



5 2 4 
2 7 2 
4 2 5 
R.M.S. error = 0.104 



F = 8 



114 8 
4 14 4 
8 5 10 

R.M.S. error = 0.0608 



Fig. 21. Round-off errors with different frame size F. 



G(m, n, k) and these matrices in the series expansions d29l > is 
given as follows: 

1) Each Mi is a permutation matrix, or complete matching. 

2) Each matrix Gj is a sum of m permutation matrices, 
Af m ( i _ 1 ) +1 , ■ • • , M mi , which represent connection pat- 
terns of those m central modules of G(m, n, k) in time 
slot i of every frame. That is, the matrix G; is an 
edge colored regular bipartite graph with degree m. 
The combination of permutation matrices in d can be 
arbitrary. 

3) Each matrix P; is a state of the switch such that Pi ^ Pj 
for all i j, and {Pi, • • • , P K } = {G x , • • • , G F }. The 
coefficient fa is the frequency of the state Pj within each 
time frame. Since the total number of constraints in the 
doubly stochastic matrix G equals to (N — l) 2 + 1, the 
number of states is bounded by K < min{P, N 2 — 2N+ 
2}- 

The scheduling information stored in the memory is linearly 
proportional to F, the frame size is limited by the access 
speed and the memory space of input modules. In the capacity 
matrix decomposition d29l) . both K and F could be too 
large in practice, because the number of states K is in the 
order of 0(N 2 ) and the frame size F is determined by the 
least common denominator of cy. The following limitations 
compromise between complexity and QoS: 

1) If the capacity matrix G is bandlimited such that Cy < 
y, then it is easy to show that K < F < 

2) The frame size F can be a constant independent of 
switch size N if [F ■ Cij] is rounded off into an integer 
matrix, and the round-off error in the order of 0(1/P) is 
acceptable. The error can be arbitrarily small if the frame 
size F is sufficiently large, as shown in the examples 
given in Fig. [21] 

Consider the traffic matrix as the aggregate of signals 
input to the packet switch, the series expansion of doubly 
stochastic capacity matrix into permutation matrices (1291 . 
and the reconstruction of capacity by weighted running sum 
of scheduled connections are mathematically similar to that 
of the Nyquist-Shannon sampling theorem of bandlimited 
signals [18]— [21] input to the transmission channel. 

Sampling theorem of bandlimited signal If a function f{t) 
contains no frequencies higher than W cps, it is completely 



determined by its ordinates at a series of equally spaced 
sampling intervals ofl/2W sec, called Nyquist intervals. If 
\F(uj)\ = 0,for |w| > 2ttW, then 



+ 00 



/(*) = £/» 



simr(2Wt — n) 
n(2Wt - n) 



where f n = /(jpj) is the n th sample of f(t). 

An important common characteristic of both series expan- 
sions is the complexity reduction of communication systems. 
Without scheduling, the total number of possible permutations 
of a packet switch for a given capacity matrix is Nl. The 
complexity reduction is achieved by limiting the space to 
those permutations only involved in the expansion (|29l . Hence, 
the dimension of permutation space is reduced to 0(N 2 ) by 
the traffic matrix decomposition. It can further be reduced to 
O(N) if each connection is bandwidth limited. The complexity 
of packet level problems can be significantly reduced if the 
switch fabric is limited to this set of predetermined permu- 
tations only, operating in a much smaller subspace, yet the 
capacity of each connection is still guaranteed. In general, the 
minimal number of permutations required is determined by the 
edge coloring of the corresponding bipartite graphs as stated 
in theorem [4] 

In digital transmission, the sampling theorem reduces the 
infinite dimensional signal space in any time interval T to a 
finite number 2TW of samples, and the minimal number of 
samples is determined by the Nyquist sampling rate for limited 
bandwidth W of the input signal function. The comparisons of 
the these two expansions and their respective roles in packet 
switching and digital transmission are listed in Table [HI] 

VI. Scheduling and Source Coding 

The process and function of a scheduling algorithm are 
similar to those of source coding in transmission. If we regard 
the set of predetermined connection patterns as a code book, 
scheduling incoming packets in the input buffer according to 
predetermined connection patterns is the same as encoding the 
source signal. This point will be elaborated by the construction 
of a scheduling algorithm based on the Huffman tree. We first 
show that the smoothness of scheduling, like source coding, 
is bounded by the source entropy, or the entropy of capacity 
decomposition. The results can then be generalized to the 
smoothness of two-dimensional scheduling of tokens assigned 
to each virtual path of the Clos network. 

A. Smoothness of Scheduling 

In light of the fairness of services, the scheduling is expected 
to be as smooth as possible. In a Clos network with path 
switching, in addition to guarantee capacity for each virtual 
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Fig. 22. Interstate time for t state within a frame of size F 
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Packet switching 


Digital transmission 


Network environment 


Time-slotted switching system 


Time-slotted transmission system 


Bandwidth limitation 


Capacity limited traffic matrix 

JV JV 

i=l i=l 


Bandwidth limited signal function 
f(t) = — / F{w)du 




JV N 

X i3 < c ij = m 
j=l i=l 


F(u) = 0, M > 2nw 


Samples 


(0, 1) Permutation matrices 


(0, 1) Binary sequences 


Expansion 


BvN decomposition (Hall's marriage theorem) 

Fm K 

*=Ef = 

i=l i=l 
Frame size=i < =lcm or denominators of cy 


Fourier series 
,^ ^ * sirm(2Wt - n) 

Nyquist interval= 1 = 


Inversion 


Reconstruct the capacity by running sum 


Reconstruct the signal by interpolation 


Complexity reduction 


• Reduce number of permutations from iV! to 0(N 2 ) 

• Reduce to O(N) if bandwidth is limited 

• Reduce to constant F if truncation error of order 
0(1/ F) is acceptable 


• Reduce infinite dimensional signal space 
to finite number 2tW in any duration t 


QoS 


Capacity guarantee, Scheduling, Delay bound 


Error-correcting code, Data compression, DSP 



TABLE III 

Comparison of traffic matrix decomposition and sampling theorem 



path, smooth scheduling also reduce delay jitters and alleviate 
head-of-line (HOL) blocking at input buffer [25]. 

Consider a scheduling of a given capacity decomposition 

C = J2& P i with frame size F - Let X l\ X z\- ■ ■ , X n] be 
i=l 

a sequence of interstate times of state Pj within a frame. It is 
easy to see from Fig. [221 that 

n t = (30) 

+ ---+X® =F, for all i = 1, • • • , fc. (31) 

Since delay jitter is one of the major concerns of scheduling, 
it is natural to define the smoothness of scheduling by the 
second moment of interstate time. 



Definition 5: The smoothness of state Pj is defined by 

L 



log' 



(32) 



and the smoothness of a scheduling is the weighted average 
given by 



A' 



L = ^ 4>iLi. 



(33) 



The following theorem reveals the fact that the smoothness 
of scheduling is parallel to the code length of source coding 
in information theory. 

Theorem 9: For any scheduling of a given capacity decom- 
position C = Yh=i 4>iPi w i tn frame size F, we have 

K 



£2" 



< 1 



(Kraft's inequality) 



and the average smoothness is bounded by 

A A 



(34) 



(35) 



-k for all i = 1, • ■ • ,K, 



Both equalities hold when XI 
and k — 1, • • • , m. 

Proof: The inequality d34l ) can be obtained by 
substituting ( f30b and (fJTJ into the Cauchy-Schwartz inequality 



A 



A 



5>^ = £ 



i=l 



=1 V Z J fc=lV vV fe ) i=l i=l 



A A 



Again, using Cauchy-Schwartz inequality, we have 

A 



A 



'E£ =1 (*f) 



A 



Z— 1 2—1 

□ 

Suppose equalities in the above theorem hold, then the 
scheduling is obviously optimal. First, we consider a simple 
example when K = F,<pi = y, and rii = 1 for all i, then 

(t) 

X{ > = F for alH = 1, • • • , F, in which case the smoothness 
equals entropy. 

i F 

L = -^lo gv ^F2) =logF = H (36) 

i=l 

When all weights are reciprocals of the power of 2, for 
example <fri — |, \ , g, g, an optimal scheduling can be easily 
found. An example is shown in Fig 1231 a), where F = 8, K = 
4, and rij = 4, 2, 1, 1 with respect to state P^ for i = 1, • • • ,4. 



Pi p 2 



p, p, 



p 4 



Pi |p 3 

(a) Optimal Scheduling 
Fig. 23. Examples of Scheduling 



P] 


Pi 


p 2 


Pi 


Pi 


p. 


p 3 


P4 



(b) WFQ Scheduling 
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The scheduling with constant interstate time = 2,4,8,8 
satisfies both equalities of theorem 10 

F = L = i-+i-2 + i-3 + i-3 = 1.75. (37) 

Another scheduling based on weighted fair queueing (WFQ) 
for the same decomposition is shown in Fig l23l b). The 
smoothness L — 1.8758 is greater than the decomposition en- 
tropy H = 1.75, and the superiority of the optimal scheduling 
is quite obvious by comparing Fig. 1237 a) and (b). An upper 
bound of smoothness is given in the following theorem. 

Theorem 10: For any capacity decomposition C = 
J2i—i faPh it is always possible to device a scheduling whose 
smoothness L is within i of decomposition entropy 

H<L<H+^ (38) 

Proof: Consider a random scheduling without frame 
structure such that the state Pj of each time slot is randomly 
selected with probability fa. The interstate time X^ of state 
Pi is a geometric random var iable with E[X^} = j-, and 
Var[X^\ = The second moment of X® and the 

smoothness of state Pj are given respectively as follows 

£[(I (, ») 2 ] = Var[X^} + [E[X^]) 2 = ^J^, (39) 

m 

and 

U = log yjE[(XM)*] = \ log (2 - fa) + log fa. (40) 

It is counter intuitive that the smoothness L of random 
scheduling does not equal to the decomposition entropy H; 
in fact we have 

K K 

L = Y,<l>iLi = -^^log(2-^)+if. (41) 

»=i i=i 

It is easy to show that the Kullback-Leibler distance L — H 
reaches the maximum value when fa = fa = ■ ■ ■ = fa% = j^, 
in which case we have 

1 K 1 11 

L-]/ = -^^,log(2- fa) < - log {2--)< -. (42) 

i=l 

The random scheduling is actually no scheduling at all. Hence, 
there should exist a scheduling algorithm, with or without 
frame, that satisfies the bound given in (l38l . □ 

It is obvious that theorem [9] and [10] are counterparts of the 
source coding theorem stated as follows. 

Noiseless coding theorem Let random variable X take 
the possible values x±,--- ,xk with respective probabilities 
p(x\), ■ ■ ■ ,p(xk)- Then, the necessary and sufficient condi- 
tion to encode the values of X in binary prefix code ( none 
of which is an extension of another) of respective lengths 
L\, ■ ■ ■ , Lk is 

K 

^2~ Ll < 1 (Kraft's inequality) (43) 

i=l 



77ie average code length is bounded by 

K K 

L = X>(zi)£i > H(X) = ^pfojlog—r (44) 

i=i i=i P( - Xt) 

and it is always possible to device an optimal prefix code for 
X whose average code length L is within 1 of entropy 

H(X) <L< H(X) + 1. (45) 



B. Comparison of Scheduling Algorithms 

The scheduling algorithm for an arbitrary set of decom- 
position weights fa achieves optimal smoothness if L = H , 
which requires the constant interstate time that equals to the 
inverse of weight Xjj^ — = 4: for all state Pj. Most 
proposed scheduling algorithms are indeed constructed from 
the reciprocal of the normalized weights. A comparison of 
several stereotype algorithms is given below. 

1) Weighted Fair Queueing (WFQ) Scheduling Algorithm: 
The WFQ scheduling algorithm [46] is constructed from the 
sequence of virtual finish times 4r , x' ' " • °f eac h state Pi w i m 
the initial finish time -j-. The algorithm is given as follows. 



The WFQ scheduling algorithm: 

Select the state with the smallest finish time and increase its 
finish time by the inverse of its weight. Repeat this process 
until the frame size is reached 



We will use the running example of a set of five states 
Pi, P2, P3, P4, P5 with respective weights fa — 0.5, fa = 
fa = fai = fa = 0.125 to illustrate the performance of 
those scheduling algorithms presented in this section. The 
sequence generated by WFQ in each time slot r within a 
frame is given in the Table IIV1 and the final sequence is 



T 


Pi 


P2 


Pz 


Pa 


Ps 


Selection 


1 


2 


8 


8 


8 


8 


Pi 


2 


4 


8 


8 


8 


8 


Pi 


3 


6 


8 


8 


8 


8 


Pi 


4 


8 


8 


8 


8 


8 


Pi 


5 


10 


8 


8 


8 


8 


P 2 


6 


10 


16 


8 


8 


8 


P 3 


7 


10 


16 


16 


8 


8 


P 4 


8 


10 


16 


16 


16 


8 


P 5 



TABLE IV 
An Example of WFQ Algorithm 



Although the WFQ algorithm properly utilizes the inverse 
weights in constructing the scheduling, the above example 
shows that the sequence generated by WFQ is still far from 
the achievable optimal smoothness. Possible amendments to 
WFQ are discussed below. 
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2) WF 2 Q Scheduling Algorithm: WF 2 Q is a scheduling 
algorithm [47] that incorporates the stringent rate requirement 
of generalized processor sharing (GPS) [48] in WFQ. Let 
Ti(r) be the number of time slots assigned to state Pi up 
to time t, the WF 2 Q will select the state Pi that satisfies 

Ti(r - 1) < r • (fn (46) 

in every time slot r = 1, 2, • • • . That is, only the states that 
have started its service in the corresponding GPS system will 
be selected. The set of states qualified for selection in time 
slot t is defined by 

Qr = {P | Ti{r - 1) < r • ct>i} (47) 

Hence, all states are qualified in the first time slot (see Table 
IV), because P;(0) — for all i. The set of qualified states 
Qi = {P1P2P3P4P5} is then ordered by their finish time 
and the state P\ is selected. In the second time slot, however, 
Ti(l) = 1 and 2 • <fii = 1, so the state Pi cannot be selected 
and Q 2 = {P2P3P4P5}. In each time slot, whenever a state 
is selected, like in WFQ, its finish time will be increased by 
the inverse of its weight. 

The following table provides the whole procedure of 
WF 2 Q. The final sequence is P 1 P 2 P 1 P 3 P 1 P i P 1 P 5 , which 
is better than the sequence produced by WFQ. Another 
approach to amend WFQ based on the Huffman tree is given 
next. 



T 


Pi 


P 2 


Pz 


Pa 


Ps 


Qr 


Selection 


1 


2 


8 


8 


8 


8 


P1P2P3P4P5 


Pi 


2 


4 


8 


8 


8 


8 


P2P3P4P5 


P 2 


3 


4 


8 


8 


8 


8 


P1P3P4P5 


Pi 


4 


4 


16 


8 


8 


8 


P3P4P5 


P 3 


5 


6 


16 


8 


8 


8 


PiPiPs 


Pi 


6 


6 


16 


16 


8 


8 


PiP 5 


Pi 


7 


8 


16 


16 


8 


8 


PlP5 


Pi 


8 


10 


16 


16 


16 


8 


Ps 


P5 



TABLE V 
An Example of WF 2 Q Algorithm 



3) Huffman Round Rabin (HuRR) Algorithm : It is known 
that the Huffman code [49] is the optimal source code con- 
structed from the binary probability tree, called the Huffman 
tree, in a hierarchical manner. When the WFQ is applied to 
only two states, the average interstate time of each state is very 
close to the optimum because the two-state Huffman binary 
tree only has one level. 

For example, consider the two states P x and P 2 with 
respective weights 4>x = 0.25 and 2 = 0.75, the WFQ will 
produce the sequence P 2 P 2 PiP 2 . The interstate time of state 
Pi is 4, which is exactly the reciprocal of 0.25, and state P 2 
has interstate time of 1 and 2, which are the two integers 
nearest 1/0.75. However, if there are more than two states, 
the WFQ fails to achieve the optimality because the Huffman 
tree has multiple levels. 

The optimality of Huffman code is closely related to Kraft's 
inequality, which is the necessary and sufficient condition of 
prefix source coding. In scheduling, the optimal smoothness 




Fig. 24. A Huffman tree of HuRR Algorithm 

L = H also implies Li = log J- for all state Pi. Hence, the 
equality of Kraft's inequality holds 

K 

■1=1 

Similar to the Huffman code, the HuRR scheduling algorithm 
proposed in [50] is a hierarchical scheduling algorithm syn- 
thesized from the Huffman tree. 

Consider each state of the decomposition as a symbol and 
the weight of the state as its probability, a Huffman tree 
can be constructed from ordered sequence of probabilities as 
usual. The probability of the root node is 1, the probability of 
each leave node is the probability of the symbol represented 
by that leave, and the probability of each intermediate node 
is the sum of the probabilities of two successors. The HuRR 
algorithm comprises the following steps. 



The HuRR scheduling algorithm: 

Step 1 Initially set the root to be temporary node Py, and 

S = Px ■ ■ ■ Px to be a temporary sequence. 
Step 2 Apply the WFQ to the two successors of Px to 

produce a sequence T, and substitute T for the 

subsequence Px • ■ ■ Px of S. 
Step 3 If there is no intermediate node in the sequence S, 

then terminate the algorithm. Otherwise select an 

intermediate node Px appearing in S and goto step 

2. 



The Huffman tree of the previous example is shown in Fig. 
l24l the HuRR scheduling algorithm will generate the following 
sequences 

PiPzPiPzPiPzPiPz -> PiPxPiPyPiPxPiPy 

-> PiP 2 PiP 4 PiP 3 PiP 5 

In this particular example, the following Huffman code can be 
generated from the tree depicted in Fig. (t24b 

Pi < — 0, P 2 < — 100, P 3 < — 101, Pi < — 110, P 5 < — 111 



16 



and the logarithm of interstate time of each state is equal 
to the length of its Huffman code. The smoothness of the 
sequence of this example is the same as that generated by 
WF 2 Q earlier. However, the HuRR outperforms WF 2 Q in 
general as demonstrated by the comparison given next. 

4) Comparison of Smoothness of Scheduling Algorithms: 
The smoothness of preceding scheduling algorithms are com- 
pared with random scheduling in Table [VI] to show their pro- 
gressive improvements. It is clear that the sequences generated 
by HuRR are, in general, closer to the entropy than the other 
two. The performance of WF 2 Q is comparable to HuRR, 
while WFQ is not satisfactory in some cases. 



Pi 


P 2 


Ps 


Pa 


Random 


WFQ 


WF A Q 


HuRR 


Entropy 


0.1 


0.1 


0.1 


0.7 


1.628 


1.575 


1.414 


1.414 


1.357 


0.1 


0.1 


0.2 


0.6 


1.894 


1.734 


1.626 


1.604 


1.571 


0.1 


0.1 


0.3 


0.5 


2.040 


1.784 


1.724 


1.702 


1.686 


0.1 


0.2 


0.2 


0.5 


2.123 


1.882 


1.801 


1.772 


1.761 


0.1 


0.1 


0.4 


0.4 


2.086 


1.787 


1.745 


1.745 


1.722 


0.1 


0.2 


0.3 


0.4 


2.229 


1.903 


1.903 


1.884 


1.847 


0.2 


0.2 


0.2 


0.4 


2.312 


2.011 


1.980 


1.933 


1.922 


0.1 


0.3 


0.3 


0.3 


2.286 


1.908 


1.908 


1.908 


1.896 


0.2 


0.2 


0.3 


0.3 


2.370 


2.016 


2.016 


1.980 


1.971 



TABLE VI 

The Smoothness Comparison of Scheduling Algorithms 



The significance of the Huffman coding scheme lies in the 
structure of the Huffman Tree. By the same token, HuRR, 
which is also implemented with the Huffman tree, can provide 
the best performance in terms of smoothness among the 
scheduling algorithms discussed in this section. However, 
HuRR has not yet been proven to be the optimal scheduling al- 
gorithm. It is clear that the key to construct optimal scheduling 
is to explore the structure of the smoothness Li of each state 
Pi in a frame of size F and Kraft's inequality that is satisfied 
by which is mathematically equivalent to the codeword 
length in source coding theorem. 

C. Two-dimensional Scheduling 

The smoothness of scheduling described in the preceding 
section is for a single-server system. However, the capacity 
matrix decomposition of the packet switch system is a two- 
dimensional scheduling aimed at multiple inputs and outputs 
[51]. The generalization of the smoothness to two-dimensional 
scheduling is discussed in this section. 

Recall that the capacity matrix C ~ [cif] of a C(m, n, k) 
Clos network is a k x k matrix, where Cij is the average 
number of tokens assigned to the virtual path Vij between 
input module i and output module j. If the frame size is F, 
then the matrix F ■ C is integer doubly stochastic, such that 

x — y h x — * k 

2^ (kj = m, 2^- (Hj = m. 

In the following discussion of two-dimensional smoothness, 
we will consider, without loss of generality, capacity matrix 
C = [cij] and 2^ c " = 1,VL- Cu = 1 of a crossbar switch 



Hj — 1 > X)j c 

for the sake of simplicity. The two-dimensional entropies of 
the capacity matrix C are defined as follows. 



Definition 6: Let C = [cy] be a doubly stochastic capacity 
matrix; the entropy of input module i is defined by 



Hi = 



N 

e c «j 

3=1 



log Cij, 



(48) 



and the the set of entropies of input modules is represented 
by the column vector 



H 



Hi 
H 2 



H 



N 



= \H U H 2 



, H N ] 



(49) 



Similarly, the entropy of output module j is defined by 



Hj = - E c ^ log dj, 



(50) 



and the set of entropies of output modules is represented by 
the row vector 



H_ — \H_ 1 ,H_2, ■ ■ ■ ,Kn\ ■ 
The entropy of the capacity matrix C is defined by 



(51) 



N 



N 



N N 



h{c) = E^-E^ = -EE c ^ lo s c y • < 52 ) 

8=1 j = l j = l 1=1 

The smoothness of a two-dimensional scheduling is defined 
by the inter-token time in the same manner as the interstate 
time of single server scheduling. Let riy be the number of 
tokens assigned to the virtual path Vij within a frame F, and 

Vl ' V2^ >'''■> V-nt] l> e me sequence of the inter-token times. 
We have 

(53) 



(54) 



for alH, j = 1, • • • , iV. It should be noted that the number of 
tokens assign to a virtual path Vij in a time slot can be more 
than one if the number of central modules m > 1, in which 
case the degenerate inter-token time y k —Q will be allowed. 
The smoothness of two-dimensional scheduling is defined as 
follows. 

Definition 7: For a given scheduling of the capacity matrix 
decomposition C = 2~^iLi fiiPii tne smoothness of the virtual 
path Vij is defined by 



log 



\ 



The smoothness of input module i is defined by 



A7 

Di — ^ ^ Cij dij 
3 = 1 



N 

E ui ' 

3 = 1 



Ci-j log ' 



(55) 



(56) 



and the input smoothness is the column vector 

£i 
D 2 



D = 



D 



N 



= \p u D 2 ,--- ,D N ] T . 



(57) 
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Similarly, the smoothness of output module j is defined by 



N 

D< = V Cijdij 



=1 



i=l 



log' 



and the output smoothness is the row vector 

R=[D 1 ,D 2 ,--- ,D N ] 



(58) 



(59) 



The smoothness of the two-dimensional scheduling is defined 
by 



D 



N 

E 

i=l 



D, 



N 

E^ 



N N 

/] / ] c ijdij- 



(60) 



j=l j=l i=l 

The properties of two-dimension smoothness are similar to 
that of single server scheduling. The proof of the following 
theorem is the same as that of theorem [9] 

Theorem 11: For capacity matrix decomposition C = 
Yl,f=i 4>iPi with frame size F, any scheduling satisfies the 
following smoothness inequalities 

1) The matrix K r — [2 _dij ], called Kraft's Matrix, is 
doubly sub-stochastic such that 



JV 
i=l 



ij < 1; 



2) For input module i 



N 

E^ 

i=i 

, N, we have 



i < 1. 



1,- 
Di>Hi. 

3) For output module j = 1, • • • , TV, we have 

4) The overall smoothness is bounded by 

D > H(C). 



(61) 



(62) 



(63) 



(64) 



m _ 



— , for all i,j — 



The above equalities hold when yY 

1, 2, • • • ,N and fc = 1, 2, • • • , n^. 

The difference between smoothness and entropy can be 
considered as the distortion of delay jitter introduced by 
scheduling. For circuit switch, the capacity matrix C is a 
permutation matrix, in which case we have H(C) = D = 0. 
Another extreme case is the scheduling of uniform capacity 
matrix C = [j*] = ■h Yli=i Pi w ^ f ram e size F — N. 
In this case, the entropy H{C) = AHog /V also equals to the 

(ii) 

smoothness D = N log N because inter-token time y k = N, 
for all i,j = 1, 2, • • • , N and k = 1, 2, • • • , mj for any 
scheduling, in which case the entropy H(C) is the maximum 
because the capacity matrix is completely uniform, and the 
smoothness of any scheduling cannot make it any worse. 

It is clear that if the Kraft matrix 

K r = [2- d ^\ 

is doubly stochastic then the two-dimensional scheduling is 
optimal. However, the existence of the optimal scheduling that 
can reach the entropy H(C) of any capacity matrix C is still 
an open issue. Nevertheless, a rule of thumb is to minimize 
K, the number of permutation matrices in the decomposition 



[52], since the maximal delay bound is on the order of 0{K) 
for most known scheduling algorithms [19], [53]. 
For example, given a doubly stochastic matrix C, 



C = 



0.75 0.125 0.125 

0.125 0.5 0.375 

0.125 0.125 0.5 0.25 

0.375 0.625 



The input entropy H and output entropy H_ of matrix C are 
given respectively by 



and 



#=[ 1.0613 1.4056 1.7500 0.9544] 



if =[ 1.0613 1.4056 1.4056 1.2988 ] 



The entropy of the capacity matrix C is H{C) = 5.1714. One 
possible decomposition is given by 



C 



6 1 

1 4 3 

1 1 4 

3 



10 
10 
10 
1 

10 
10 
1 
10 



10 
10 
10 
1 

10 
10 
1 
10 

1 
10 
10 
10 



in which K = 5 and the permutation matrices are denoted 
by {Pi,P2,--- jA}, respectively. If the WFQ scheduling 
algorithm is applied to this set of permutation matrices, the 
resulting sequence is 

P 1 P 1 PiPiP 2 P 3 P i P 5 

which gives rise to the following two-dimensional scheduled 
tokens 

aaaaaabc 
bbbbcddd 
ccccbbab 
dddddcca 

where each row corresponds to an output module and the sym- 
bol a, b, c and d represents the respective tokens assigned to 
input module 1 to 4. The input smoothness of this scheduling 
is given by 

25= [ 1.2084 1.6997 2.0323 1.3118] f 

and the output smoothness is 

£=[ 1.2084 1.7636 1.6997 1.5805] 

which sum up to D = 6.2522. When the HuRR scheduling 
algorithm is applied to the same set of permutation matrices, 
the resulting sequence is 

PlP 2 PiPiPiP 3 PiP 5 
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which gives rise to the following two-dimensional scheduled 
tokens 

aaabaaac 
bcbdbdbd 
cbcacbcb 
dddcdcda 

This result is obviously smoother than the previous one as 
evident by the following smoothness measures 



and 



D = [ 1.1250 1.4375 1.7902 1.0267 
£=[ 1.1250 1.4375 1.4375 1.3794 



which sum up to D = 5.3794, an improvement consistent 
with the comparison of single server scheduling described in 
the preceding section. 

Suppose we consider another capacity matrix decomposition 



8-C 



+2 



10 
10 
10 
1 

10 
10 
1 
10 



1 
10 
10 
10 

10 
10 
10 
1 



in which the number of permutation matrices K = 4 is 
reduced. Again, applying the HuRR scheduling, the resulting 
sequence is 

P1P4P3P4P2P4P3P4 

which gives rise to the following two-dimensional scheduled 
tokens 

baaacaaa 
cbdbdbdb 
acbcbcbc 
ddcdadcd 

The smoothness measures of this two-dimensional scheduling 
are given by 



and 



D = [ 1.1250 1.4375 1.7500 1.0267 
£=[ 1.125 1.4375 1.4375 1.3392 ] 



which sum up to the overall smoothness D = 5.3392. By 
comparing the output smoothness D_ with that of the previous 
decomposition, we find some improvement in the fourth 
output. Similar improvement can be found at input module 
3, where the optimal smoothness D 3 = 1.7500 = H 3 is 
achieved. The overall smoothness D = 5.3392 is also better 
than the previous D = 5.3794 but still greater than the entropy 
of capacity matrix H(C) = 5.1714. 

Preceding examples indicate that the performance of path 
switching depends not only on the scheduling algorithm but 
also on the number of permutation matrices in the capacity 
matrix decomposition. As we mentioned above that the num- 
ber of permutation matrices in the case of uniform capacity 



matrix with maximal entropy is K = N, and the worst delay 
bound is on the order of O(K) for most known scheduling 
algorithm, it is therefore reasonable to arrive at the conjecture 
that K is on the order of O(N) for the optimal decomposition 
of any capacity matrix, even though the known bound of K in 
the Birkhoff von-Neumann decomposition is A 2 — 2A+2. The 
Kraft matrix and the smoothness measure introduced in this 
section will provide the objective of optimal two-dimensional 
scheduling to be explored in the future, which is expected to be 
a very hard problem because of the large number of possible 
combinations. 

VII. Conclusion 

The parallels between packet switch subject to the con- 
tention and transmission channel in the presence of noise 
are consequences of the law of probability. Input signal to 
transmission is a function of time, the main theorem on noisy 
channel coding being based on the law of large number. On 
the other hand, the input signal to switch is a function of 
space, and both theorems on deflection routing and smoothness 
of scheduling are proved on the ground of randomness. In 
both systems, the random disturbances are tamed by means of 
similar mathematical tools aimed for reliable communication. 

The communication network has gone through two phases 
of quantization in the last century. The first phase was the 
quantization of transmission channels, from analog to digital 
based on the sampling theorem of bandlimited signal. The 
second phase is the quantization of the switching system, from 
circuit switching to packet switching. The comparisons pro- 
vided in this paper demonstrate that the scheduling based on 
capacity matrix decomposition serves the same function as the 
sampling theorem to reduce the complexity of communication. 

The concept of path switching can further be extended 
to the network level to cope with resources contentions. A 
connection-oriented sub-network with predetermined topology 
and bandwidth can be embedded in the current IP network 
for supporting QoS of real-time services such as voice or 
video over IP. The scheduling of path switching in conjunction 
with the routing scheme of multiprotocol label switching 
(MPLS) will provide a platform to support different traffic 
classes of differentiated services (DiffServ) [54], [55]. The 
predetermined paths of label switched sub-network are similar 
to the subway system embedded in the public transportation 
network. It will provide a coherent end-to-end QoS solution 
to the multilevel resource allocation [14] issues arising from 
real-time services. First, the predetermined connection patterns 
at each router are contention-free at the packet level. Next, the 
periodic connection patterns are frame based, the bandwidth 
assigned to each session is fixed at burst level within each 
frame. Finally, the stable capacity of each session is guaranteed 
at call level. The traffic engineering of this internet subway 
system could be a challenge networking research area in the 
future. 

Acknowledgement 

The development of this work was helped by discussions 
over the years with my students Cheuk H. Lam, Philip To, Man 



19 



Chi Chan, S. Y. Liew, Yun Deng, Manting Choy, Jianming 
Liu, Sichao Ruan, and Li Pu, all members of Broadband 
Communication Laboratory at the Chinese University of Hong 
Kong. 

Appendix I 

BOLZMANN MODEL OF PACKET DISTRIBUTION 

Consider an N x N switch as a thermal system. The outputs 
are particles and the energy level of an output, denoted by 
£j, equals to the number of packets destined for that output. 
The distribution of packets over outputs can be determined by 
maximizing the Boltzmann entropy [24]. 

Suppose there are M packets sending from N input ports 
to iV output ports. Let m be the number of output ports with 
energy level £j. An example with N = 8 and M = 4 is shown 
in Fig. [25] where n = 5, n\ = 2, and n 3 = 1. The number of 
possible divisions of N outputs into r distinct energy levels 
of respective sizes n , ■ ■ ■ ,n r is 



nolnil ■ ■ ■ n r l ' 

and the number of possible divisions of M packets into N 
distinct outputs of respective sizes 0, • • • , 0, • • • , r, ■ ■ ■ , r is 



Ml 



(0!)«o(i!)"i . . . ( r |)n r 

Suppose each input can send one packet at most in any time 
slot, then the total number of states of the entire switch is 
given by 

M N\ Ml 

W 



(N-M)lMl n !rii! • • -n r l (0!)™°(l!)"i • • • (r!)™- 

(65) 

subject to the following constraints: 



N = n Q + ra i + n 2 ■ ■ ■ +n r 



and 



(66) 
(67) 



M = Ouq + lni + 2772 • ■ • +rn r . 

The Boltzmann entropy S of the system is given by: 

S = C\nW (68) 

where C is the Boltzmann constant. The maximal entropy can 
be obtained from the following function formed by Lagrange 
multipliers 

f{rH)=\nW + a(£ i n i -N)+p(£ i in i -M). (69) 




«„=5 1DOOOD 

n,=2 Oral 5R1 
«,=1 2 \bfi\ 

Output ports: Particles 

Packets: Energy Quantum 



Fig. 25. Blotzmann model of packet distribution 



Using Stirling's approximation ma;! « x In x — x for the 
factorials, we have 

f(n t ) = N In N - N - (N - M) ln(N - M) + (N - M) 
+ NlnN - N - ^(n i lnn l - m) - n< ln(i!) 



y{J2 m - N) + (3(J2 ini-M). 



(70) 



Taking the derivatives with respect to n*, and setting the result 
to zero and solving for yields the population numbers: 

Tli - — — . (71) 



If the offered load p is uniform on each input, then we have 

(72) 



p = M = E^j = e p 



The probability that there are i packets destined for a particular 
output has the Poisson distribution: 



m 

Pi = TT 



= e~ p ?-, z = 1, 2,3, ... (73) 



The carried load equals the probability that an output port is 
busy 

p'= 1-po = l-e-P (74) 

which is consistent with ([]]). Hence, our proposition on the 
signal power of switch is verified. 

In the above derivation, the assumption that each input can 
only send at most one packet in any time slot can be relaxed. 
As long as packets are distinguishable, the total number of 
states of outputs is 

AM Ml 



W = 



(75) 



n !ni! ■ • • n r l (0!)"" (l!)™i • • • (r!) n <- 

and the distribution of packets is still Poisson. On the other 
hand, suppose all packets are indistinguishable, then the total 
number of states of outputs becomes 



W = 



no!?ii!n2! • • • n r \ 



(76) 



Following the same procedure, we obtain the geometric dis- 
tribution of packets 

1 



Pi = 



(t^)\ 7 = 0,1,2,. 



(77) 



1+p 1+p 

which will be inconsistent with (H). 

We next show that both the signal S p and noise N p of an 
N x N crossbar switch are normally distributed under the 
assumptions of homogeneous input loading p and uniform 
output address. 

The signal power S p is the sum of the following i.i.d. 
random variables: 



Xi 



1 if output i is busy 
otherwise 



with mean = p' and variance Uar[A^] = p'(l — p'), 

where p' is the carried load given in (Q]). It follows from 
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central limit theorem that the signal power S p — Y^i=i ^ 
is normally distributed with mean E[S P ] = Np' and variance 
Var[S p ] = Np'(l - p'). The noise power N p = N - S P 
becomes independent of the signal power S p when N is large. 
By using the similar argument, the noise Np is also normally 
distributed with mean E[N p ] = N(l - p'), and variance 
Var[N p ] = Np'(l - p'). 
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